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1 -MINIMAX  AND  MINIMAX  DECISION  RULES 


FOR  COMPARISON  OF  TREATMENTS  WITH  A CONTROL* 
by 


Shanti  S.  Gupta  and  Woo-Chul  Kim 
Purdue  University 


1 . Introduction. 

In  many  fields  of  research  one  is  faced  with  the  problem  of  comparing 
k experimental  categories  with  reference  to  a 'standard'  or  a 'control'. 
Following  the  initial  investigation  by  Paulson  (1952),  this  problem  has 
been  studied  in  several  different  formulations  by  Dunnett  (1955),  Gupta  and 
Sobel  (1958)  and  Lehmann  (1961)  among  others. 

Let  n -j , . . . , ii k denote  the  k experimental  categories  or  'treatment' 


populations  and  let  ttq  denote  the  'control'  population,  where  the  quality 


of  each  population  is  characterized  by  a real-valued  parameter  e.. 

(i  = 0,1, ...,k).  Each  treatment  population  is  said  to  be  'superior', 
'equivalent'  or  'inferior'  to  the  control  population  Hq  if  1 A, 

-a  < o.  - Oq  < a,  Q-j-Og  i -a,  respectively,  where  a is  a given  positive 
constant.  We  consider  a problem  in  which  the  treatment  populations  are  to 
be  classified  as  one  of  the  above  three  cases  based  on  the  observations  from 
the  populations.  Bhattacharyya  (1956,  1958)  studied  this  problem  for  the 
normal  populations  with  unknown  means  when  the  control  population  is 
assumed  known.  A similar  problem  has  been  considered  by  Seeger  (1972). 

We  apply  the  i-minimax  principle  to  this  problem. 


♦This  research  was  supported  by  the  Office  of  Naval  Research  contract 
NOOt i 4- 75-C-0455  at  Purdue  University.  Reproduction  in  whole  or  in  part 
is  permitted  for  any  purpose  of  the  United  States  Government. 
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r-minimax  principle  is  known  as  one  of  the  techniques  for  the  use  of 
incomplete  prior  information.  Such  an  idea  was  first  used  by  Robbins  ( 1 9b 1 ) 
and  independently  by  Hodges  and  Lehmann  (1952)  and  Menges  (1966).  The  name 
r-minimax  was  first  used  by  Blum  and  Rosenblatt  (1967).  Randles  and  Hollander 
(1971)  applied  such  a principle  to  a problem  of  selecting  the  treatments 
'better'  than  the  control.  It  has  been  applied  to  various  problems,  and 
recently  to  selection  problems  by  Gupta  and  Huang  (1975,  1977),  Berger  (1977) 
and  Miescke  (1979). 

In  Section  2,  necessary  notations,  definitions,  a loss  function  and 
the  incomplete  prior  are  introduced.  A lemma  is  given  to  help  find  r-minimax 
rules.  Section  3 treats  the  case  of  known  control  population,  and  a r-minimax 
rule  and  a minimax  rule  are  derived.  In  Section  4,  the  case  in  which  the 
control  population  parameter  is  unknown  is  treated.  Rules  are  derived 
which  are  r-minimax  among  rules  for  which  the  decision  about  the  i-th 
population  depends  only  on  the  observations  from  and  Hq.  A minimax  rule 
is  also  derived.  A normal  means  problem  and  a normal  variances  problem  are 
given  as  specific  examples.  Section  5 consists  of  comparisons  of  r-minimax 
rules  with  Bayes  rules  for  independent  normal  priors  for  the  normal  means 
problem. 

2.  Formulation  of  the  problem. 

Let  xq,X1 , . . . ,Xk  be  k+1  independent  random  variables  representing  the 
control  population  nQ  and  the  k treatment  populations  nj respectively. 


statistical  decisions.  We  assume  that  each  f^(.)  (i  = 0,1 k)  is  symmetric 

about  the  origin  and  strongly  unimodal,  i.e.,  f,j(-)  is  log-concave  on  the  real 
line.  Hence  f.(x-o-)  has  the  monotone  likelihood  ratio  (MLR)  property. 
Obviously,  we  do  not  need  any  observations  from  u0  when  eQ  is  assumed  known; 
therefore,  it  will  be  understood  that,  in  such  a case,  the  random  variable 
is  deleted  from  our  consideration. 

The  action  space  G can  be  written  as  G = Uj  x...x  G^  where  G^  = {1,2,3} 

for  i = l,...,k.  The  action  a = (a1 afc)  € G is  to  be  interpreted  in  such 

a way  that,  for  i = l,...,k,  the  treatment  population  *.  is  classified  as 
'inferior',  'equivalent'  and  'superior'  to  ttq  for  ai  = 1,2,3,  respectively. 

The  loss  L(o,a)  incurred  by  the  action  a € G for  e = (®q *•--•6^)  is  assumed  to 
be  of  the  following  form. 

k 

L(o,a)  = l L.(e,a.)  (2.1) 

i=l  1 ' 1 

where  L.(e,a..)  is  defined  as  in  the  following  table; 

Table  of  loss  L^e.a^) 


Here,  a ^ = A-€,  = a+£  for  a given  constant  €:  0 <_  < a and  it  will  be 


. v 


4 


understood  that  the  second  row  and  the  fourth  row  will  disappear  when  c = 0. 
Bhattacharyya  (1956)  derived  a minimax  rule  assuming  the  above  loss  function 
with  Jij  = ~ a i +K. 3 = 1 and  fc  = 0 when  Oq  is  assumed  known  and  are 

the  unknown  means  of  normal  distributions.  However,  the  irregularity  of  such 
a loss  function  has  been  pointed  out  in  the  sense  that  the  minimax  risk  does  not 
tend  to  zero  even  if  the  sample  sizes  increase  indefinitely,  and  the  same 
problem  has  been  studied  afresh  by  Bhattacharyya  (1958)  assuming  the  above  loss 
function  with  ^ = ^ = H = = ^ ant*  e > 0.  Note  that  the  above  loss 

function  with  6 > 0 assumes  the  indifference  zones. 


For  given  x = (xq.Xj  ,. . . ,x^)  consider  decision  rules  of  the 
<5  ( X ) = («1(x),...,6k(x)) 


(2.2) 


where  6.(x)  = (6i(l|x),  (2 1 x) , (3|  x) ) and,  for  j = 1,2,3,  6 ^-  ( j | x ) 

denotes  the  conditional  probability  of  taking  action  j in  the  i-th  component 

decision  problem.  Note  that  there  is  no  loss  of  generality  in  considering 

decision  rules  of  the  form  given  in  (2.3).  The  risk  function  of  a rule 

k 

6 for  fixed  e is  then  R (e , 6 ) = £ R.(e,6^)  where  R.fo.s^)  = E()[L^(o,iS1.(X))]. 

For  a prior  distribution  -r(e)  of  e,  the  overall  risk  of  a rule  6 wrt  ( is 

k 

denoted  by  r(T,6)  = ^ r.(x,6j)  where  = /Ri  (e,6i  )dT(e). 

It  is  assumed  that  partial  prior  information  is  available  to  a decision 
maker  such  that,  for  each  i,  he  can  specify  y^  = | ^ -Oq | > and 

■y i 3 PC | © ^ -©o I < Al^  where  L 1 f°r  i = l»-**»k*  Let  r denote  the 

class  of  all  such  prior  distributions,  i.e.. 


r = Me):  / dt(e)  = yi(  / dT(e)  = y'.  for  i=l,...,k). 

Iei-0OI-A2  " l°i-«0l<Al  ' (2  3) 

Note  that  when  = 0,  i.e.,  a^  = A,,,  y^  + y!  = 1. 
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A rule  .v  is  called  a r-rainimax  rule  if  sup  r(T,6r)  = inf  sup  r(x,6), 

x£r  6 x€r 

and  sup  r(x,6  ) is  called  the  r-minimax  value.  The  next  result  is  useful 

ifcr 

to  find  the  r-minimax  rule. 


Lenina  2.1.  Suppose  {xn»  n = 1,2,...}  is  a sequence  of  priors  in  r. 

. - r r 

If  lim  inf  r(x  ,<s)  >_  c and  if  sup  r(x,<5  ) <_  c,  then  6 is  a r-minimax 

n <s  x€r 

rule  and  c is  the  r-minimax  value. 


Proof.  The  result  follows  from  the  following  inequalities. 


sup  inf  r(x »<$)  > lim  inf  r(x  .6) 

xfcr  6 n 6 n 

_>  c 

> sup  r(x,6r) 
x€r 

>_  inf  sup  r(x,6) 

6 x€r 

>_  sup  inf  r(x,6). 

x€T  6 


3.  Known  control  population 

In  this  section  eQ  is  assumed  known  and  thus  we  may  assue  0q  = 0 without 
loss  of  generality.  Hence  x and  e in  this  section  denote  (x^  • . . . .x^ ) and 
(»1,...,ok),  respectively.  Let  us  consider  a rule  6(x)  of  the  form  in  (2.2) 
where  i ( j | x ) (j  = 1,2,3)  is  given  by 

M1!*)  = I(-.-d1](xi)* 

6i(2|?)  = I(-di,d.)  (x.), 

6 i ^ 3 1 - ^ = I[di,»)^xi^’ 


for  0 d. 


l 


and  i = 1 . . ,k. 


(3.1) 
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Lemma  3.1.  Suppose  that  a decision  rule  6 (x ) is  given  by  (2.2)  and  (3.1). 


Then,  for  i = 1 ,. . . ,k. 


sup  r (t,6.)  < v. 
T€r 


where  v.  = / U3Yl  Vx+A2*+Vi ( VX"A1  )+Vx+Al  ^ )-H4{1'YrYi } Vx+Al  * Jdx 
di 

di 

+ / alYl-iri(x-A2)dx. 

— oo 

Proof.  It  follows  from  the  definition  of  L^(e.a^)  and  the  symmetry  of  fj(-) 


that,  for  |e. ) < a^ 


(*•(0.6^  = «.2[fi(di-e.)-fi(-d1-0i)] 

f^e.-d.) 

= *2f i (ei+di  )CT.'('e.+d17  _1  ^ 


where  R!  denotes  the  derivative  of  R..  wrt  . 

It  follows  from  the  MLR  property  of  f. (x-e^  that  R!(o,6. ) has  at  most 
one  change  of  sign,  from  negative  to  positive  if  there  is  any  sign  change 
at  all;  therefore,  R^ (0,6^)  attains  the  supremum  over  6 (-a^a^)  at 
either  = -A1  or  ei  = a-| . Hence,  for  le^  < A1 , 


R^e.6^  < i2  ! [fi(x-A1)+fi(x+A1)]dx. 
di 

It  can  be  easi ly  shown  that 


R.(e,6.) 


± I ^(x-A^dx+tj  / f.(x+A2)dx  for  1 0 . | . • A.,, 

-«»  d • 

1 


00 

1 *4  d.Vx+Vdx 


for  A,  < | o,|  • a2. 
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Therefore,  it  follows  from  (2.3)  that  sup  r.(x,6.)  ^v.  which  completes 

xtr  1 1 1 

the  proof. 

Now  we  derive  a r-minimax  rule  for  the  case  where  0Q  is  known. 

Theorem  3.1.  Assume  that  independent  random  variables  X,  ,...,Xk  have 
f](xr0i),...,fk(xk-ok),  respectively,  with  f ^ ( • ) being  symmetric  and 
strongly  unimodal,  and  that  the  loss  function  is  given  by  (2.1).  Then 
the  r-minimax  rule  a1  is  given  by  (2.2)  and  (3.1)  where  each  d^  = d^  in 
(3.1)  is  defined  by  d^  = max(ci>0)  with  c.  being  determined  by 

Vifi  (X+A2)+Vi  (fi  (x'Al  ^+fi  <X+A1  ))+t4(l-Yi-Yj  )ft  (x+A1 ) 

(3.2) 

1,  > ViV*'^  as  x > , < c.. 

Proof.  The  existence  of  a c^  satisfying  (3.2)  follows  from  the  MLR 

p 

property  of  f^x-o^ ).  Therefore,  the  decision  rule  6 is  well  defined. 

First,  we  will  consider  the  case  when  e > 0,  i.e.,  a2  > A-j.  For  n > A^ , 
let  xR  be  a prior  distribution  in  r under  which  0^,...,ek  are  independent, 
P(u1=a2)  = P(o.  = -a2)  = y^/2,  P(oi=A1)  = P ( 0 i - - A'j ) = (l-y^-Yp/2  and 
P(o.=A1-n  ^)  = P(o.  = -A,+n_1)  = y(-/2  for  i = l,...,k.  Then  it  can  be 

ii  ii  l k 

easily  verified  that  inf  r(x  .6)  = \ inf  r.(x  ,6.)  and,  for  i = l,...,k, 

6 n i=l  6.  1 n 1 

oo 

inf  r.(xn,6.)  = / pn(x)dx/2, 

A.j  -«> 

where  pn(x)  = min{pn(l,x),  p(2,x),  pn(l ,-x) } with  p(2,x)  = y, [fi (x+a2)  + 
fj(x-A2)]  and  pn(l,x)  = h2y • [fi (x-A^n-1  )+fi (x+A^-n-1 )]  + s^O-y^y- )fj(x-Aj )+ 

( ^ 1 + v 3 ) Y i f i ( x~ A2 ) * Since  fj(-)  is  strongly  unimodal  on  the  real  line,  fj(-)  is 
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continuous  and  thus  pn (x ) converges,  as  n ->  »,  to  p(x)  = min{p(l ,x) , p(2,x), 

p(l,-x)}  where  p(l,x)  = lim  pn(l,x).  Note  that  p(l,x)  > p(l,-x)  if  and  only 

n-*» 

if  x >_  0.  This  follows  from  the  fact  that,  for  any  t > 0 f^(x-t)  > f^(x+t) 
if  and  only  if  x >_  0.  Since  Pn(x)  is  bounded  above  by  p(2,x)  which  is 
integrable,  it  follows  from  the  Lebesgue  convergence  theorem  that 


lim  inf  ri(xn,6i)  = / p(x)dx/2 


n 6,- 


(3.3) 


= / min{p(2,x),  p(l  ,-x)}dx. 
0 

oo 

Note  that  / min{p(2,x),  p(l,-x)}dx  can  be  written  as 
0 


min{p(2,x),  p ( 1 ,-x)}dx 


* IQ  min^3Yifi(x+A2)+A4(l-Yi-Y;)f1(x+A1)+t2Y:[fi(x+A1)+fi(x-A1)], 

0 

«,iYifi(x-A2)}dx  + / «-1Yifi(x-A2)dx 

- ee 

oo 

= ^ ta3Yifi  (x+A2)+i4(1-Y^-Yj  )fi  ( X+A i )n2Yj  (x-A1  ) + fi  (x+Aj  ) )]dx 

d! 

+ / t^^U-A^dx, 

— oo 

where  d^  = maxfO.c^)  with  c^  defined  as  in  (3.2). 

It  follows  from  Lemma  3.1  that  lim  inf  r.(x  ,6.)  > sup  r.(x,id).  Therefore, 

n 6.  ' n 1 1 1 

k 1 

lim  inf  r(T„,6)  = lim  7 Inf  r.(x  ,<5.) 
n 6 n n 1*1  6j  1 n 1 

> l sup  r.(t ,6^) 

i=l  ter  1 1 

> sup  r.(x,6r). 
xfcr 


x6r 


■ ■ 





I ■ H 1 1 1 BMW  ^ I — — 


- -i  . - - ~ 
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Hence  Lemma  2.1  yields  that  6l  is  a r-minimax  rule.  This  completes  the  proof 
of  the  case  when  € > 0.  Note  that  a^  = a2  = A and  Y-j  + yj  = 1 for  i = l,...,k 
if  c 0.  When  e.  = 0,  let  us  consider  a sequence  of  prior  distributions, 
hn,  n A_li,  in  r under  which  e^,...,ek  are  independent,  P(e.=A)  = P ( e ^ = -a)  = 
fj/2  and  P(o^=A-n”^)  = P ( o . = -A+n'^)  = yl/2  for  i = l,...,k.  Then  we  can 

p 

prove  in  the  exactly  same  manner  as  the  above  that  6 is  a r-minimax  rule. 

Now  we  discuss  the  derivation  of  the  minimax  rule  for  some  special  cases. 

A minimax  rule  can  be  derived  from  the  arguments  in  the  proof  of  Theorem 
2.1.  For  this  purpose,  assume  that  t-j  = j^,  £4  <_  28,j  and  £3  5.  We  may 
assume  that  8.^  = = 1 without  loss  of  generality.  Let  us  consider  a rule 

6*  of  the  type  given  by  (2.2)  and  (3.1)  where  each  di  = d*  in  (3.1)  is 

co 

determined  so  that,  for  (x)  = / f^(t)dt, 

“OO 

Fi(drA2)n3Fi(_drA2)  = Fi(-drAi)+Fi("di+Ai>-  {3-3) 

Note  that  the  existence  of  such  a non-negative  d*  follows  from  the  strong 
unimodality  and  the  symmetry  of  f^(*)-  Let  us  define  and  y!  = 1-y^  for 
i = 1 ,. . . ,k  by 

Yi  = [f  i (d*-A-|  )+f  i (d^+A1  )]/[fi  (d^-A2)-<.3iri  (d*+A2)+fi  (d^-A-j  )+f i (dt+A] )]. 

Since  y^  fc  [0,1],  we  can  consider  a family  of  prior  distributions,  r,  given 
by  (2.3).  Then  it  follows  from  Theorem  2.1  that  the  corresponding  r-minimax 
rule  is  of  the  same  type  as  6*  except  that  now  di  = max(c1-,0)  where  c^  is 
determined  so  that 

H(ci)  = y1-Lv.3fi(ci+A2)-fi(ci-A2)]+yi[f.(ci-A1)+fi(c1+A1)]  = 0. 

p 

Since  fl(d*)  = 0 and  dt  > 0,  d.  = d*,  i.e.,  the  rule  6*  is  the  r-minimax 
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rule;  therefore  it  follows  from  the  arguments  in  the  proof  of  Theorem  2.1 


that 


lim  inf 
n 6 


r(xn,«)  1 iIiYi[F.(d^-A2)n3Fi(-d^-A2)]+Y![F.(-d*-A1)+Fi.(-di*+A1)] 


= l [F.(-dt-A,)+F.(-df+A,)] 
i=l  1 1 1 1 1 1 


k 

= l sup 

1=1  e 1 ' 


>_  Sup  R( 6 , 6* ) 

e 

Therefore,  we  have  the  next  result  which  includes  the  results  in  Bhattacharyya 
(1956,  1958)  as  special  cases. 


Corollary  3.1.  Under  the  assumptions  in  Theorem  3.1 , if  = d2  = 1 , •-  1 

M M 

and  1.  <_ 2 , then  a rule  of  the  type  given  by  (2.2)  and  (3.1)  with  d^  = d^ 

in  (3.1)  being  determined  by  (3.3)  is  minimax. 


4.  Unknown  control  population. 

In  this  section  we  will  consider  the  case  when  is  unknown  and 

r 

will  derive  a r-minimax  decision  rule  6 in  the  class  of  decision 
rules  for  which  6.(x)  in  (2.2)  depends  only  on  Xq  and  x..  for  i = l,...,k. 
Let  us  consider  rules  6(x)  in  where  6,.(j|x)  (j  = 1,2,3)  are  given  by 

(M1!*)  = I(-o,-d.](xi-xo)’ 

w2l5)  = I(-di,di)(xi"xo)*  (4J) 

lS i(3l*}  = I[di,-)(x.-x0)’ 
for  0 £ d . <<«  and  i = 1 , . . . ,k. 


- ■■ - - ■-  r - 


(4.2 


Note  that  the  pdf  of  Y..  = X^-Xg  is  given  by 

9i(y-(«»i-‘>0))  = / ^(t+y-o^fgd-CgJdt, 

and  that  g^-)  is  strongly  unimodal  by  the  result  of  Ibragimov  (1956)  and 
symmetric  about  the  origin.  Therefore,  the  next  follows  from  this  fact  and 
Lemma  3.1. 


Lemma  4.1.  Suppose  that  a rule  6(x)  in  jOq  is  given  by  (2.2)  and  (4.1) 
Then,  for  i = 1 , . . . ,k. 


where,  for 


sup  r . (x,6  . ) < a)., 
t€1’ 


k. 


a>i  = / L^37igi(y+A2)+Ji2Y](gi(y-A1)+gi(y+A1))+)64(i-Y.-Y1!)g.(y+A1)]dy 

di 

+ / «-1Yigi(y-A2)dy. 


We  now  proceed  as  in  Theorem  3.1  by  considering  the  following  sequence 
{ in,n  > a ^ } of  prior  distributions  in  r for  the  case  when  € > 0.  Under 

(i)  0-j  - Ug,...,ok-a0  are  independent, 

(ii)  PC  °i~  ()0=  = pt0i-eo  = -^3  = V2’ 

p[°r°0  = Al]  = Pt0i'9O=  'V  = (1_VYP/2, 

PL°i~°o  = Arn_13  = pt0i'eo  = -Ai+n'13  = Y^/2  and 

(iii)  Ug  has  uniform  distribution  over  [-n,n]  and  is  independent  of  ei-0g»* 


It  can  be  easily  shown  that  the  overall  risk  of  the  Bayes  rule  is  given  by 


(4.3) 


UI  r(Tn  ’6)=hi.lJ  1 pn(i’x’y)dxdy 
where  pn(i,x,y)  = min{sn(i ,x,y) , tn(i,x,y),  sn(i,-x,-y)}  with 

n -l  -1 

s (i,x,y)  = / [fi(x-u-A1+n  )+f . (x-u+A] -n  ‘)]f0(y-u)du  + 

-n 

n 

+ s-4n-Yi-Y’-)  / f.(x-u-A1)f0(y-u)du  + 


n 

+ (^1+^3)y1-  / fi(x-u-A2)f0(y-u)du  and 
n 

t (i,x,y)  = tlYi  / [fi(x-u+A2)+fi(x-u-A2)]fQ(y-u)du. 

-n 


From  change  of  variables  x = nv-w  and  y = nv+w,  it  follows  that 


/ / Pn(i ,x,y)dxdy/4n  = / / pn(i ,nv-w,nv+w)dvdw/2 

(44) 

oo  i 

_>  / [/  p (i,nv-w,nv+w)dv]dw/2. 

In 

Note  that 


n(v+l)  , 

s (i , nv-w, nv+w)  = t0y.  / [f.(z-w-A,+n  )+f . (z-w+A,-n  )]fn(z+w)dz  + 

n 2 1 n(v-l)  11110 

n(v+l ) 

+ «-4(l-y.-y!)/  f.(z-W-A,)fn(z+w)dz  + 

4 1 1 n(v-l)  1 1 0 

n(v+l ) 

+ Ui+tjy.  / f<(z-w-A?)fn(z+w)dz  and 

1 3 1 n(v-l)  1 2 0 

n(v+l ) 

t (1 ,nv-w,  nv+w)  = t,y.  / [f . (z-W+A,)+f . (z-W-Ap)]fn(z+w)dz. 

n 1 1 n(v-l)  1 21  20 

Therefore,  for  any  (v,w)  € (-1,1  )xR,  p (i,  nv-w, nv+w)  converges,  as  n ► •>•, 
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to  p(i,w)  = min{s(i ,w) , t(i,w),  s ( i ,-w) } where 


s(i,w)  = (z-w-a^ )+f^ (z-w+a1  )]fQ(z+w)d2  + 

“to 

0 o 

+ / ^(z-w-A^fgU+wJdz  + 


+ f j(z-w-A2)fg(z+w)dz  and 


t(i,w)  = / [f i (z-w+A2)+fi  (z-w-A2)]f0(z+w)dz. 

— oo  ^ y 

It  follows  from  (4.4)  that,  for  i = l,...,k, 


UE  / / Pn(i>x,y)dxdy/4n  > / p(i,w)dw 


n “oo  —oo 


= ^ h(i,y)dy, 

where  h(i,y)  = minify (y-A}  )+g.  (y+^}  )]n4(l-yi-Ypgi  (y+A, ) + 

+ (d1n3)yigi(y+A2),  t1Y1-[gi(y+A2)+gi(y-A2)]}. 
Then  from  (4.3),  we  have 


ljm  inf  r(x  ,6)  > £ ( h(i,y)dy. 
n <sti>0  i = l 0 

<o 

Note  that  ^ h(i,y)dy  can  be  written  as 


(4.5) 


l h(i,y)dy  = ^n{^3Yigi(y+A2)n4(i-Yi-Y;)gi(y+A1)n2Y:(g1.(y+A1)+g1.(y-A1)), 

0 

^1Yigi(y-A2)}dy  + / ^^(y-A^dy 
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= / [a3Yi9i (y+A2)+jt2v^ (gi (y^  ^ (y+A1 ) J+^O-Yj-Y! )g1  (y+A, )]dy 
1 d. 

i 


+ / ^Y^ty-A^dy, 


where  di  = max(ci ,0)  with  ci  being  determined  so  that 

^3Yigi (y+A2)+K.2Y! (gi (y-A] )+gi (y+A] ) )+t4(l-Yi~Y : )gj (y+A1 ) 


£»  1 i1Yigi(y-A2)  as  y >,  <c.. 


(4.6) 


Let  61  be  the  rule  given  by  (2.2)  and  (4.1)  where  d^  = d^  in  (4.1)  is 
defined  by  d^  = max(c..,0)  with  determined  by  (4.6).  Then  it  follows 
from  (4.5)  and  Lenina  4.1  that 

k r 

lim  inf  r(i  ,6)  >_  l sup  r.(i,6.) 

n <56B,  < = 1 1 1 


0 


i=l  rtr 

>.  sup  r(-r,6r). 
x€r 


Therefore,  Lemma  2.1  yields  the  next  result. 


Theorem  4.1.  Assume  that  independent  random  variables  XQ,...,Xk  have 
pdf's  fr0^xo“eO^’”'  ,fk^xk‘0k^’  resPectiyely,  with  f^(-)  being  strongly 
unimodal  and  symmetric,  and  that  the  loss  function  is  given  by  (2.1).  Then 
the  r-minimax  rule  6 in  is  given  by  (2.2)  and  (4.1)  where  d^  = d^  in 
(4.1)  is  defined  byd^  = max(c.,0)  with  ci  being  determined  by  (4.6), 
for  all  i = 1,2,... ,k. 

Remark  4.1.  It  can  be  easily  shown  that  the  syimetry  of  f..(*)  in  Theorem 
4.1  can  be  replaced  by  that  of  g^ (•)•  It  should  be  noted  that  the  symmetry 
of  g^(-)  follows  when  fg( »f k( - ) are  identical. 

I ______ 




The  next  result  follows  in  exactly  the  same  manner  as  Corollary  3.1 


was  proved. 


Corollary  4.1 . Under  the  assumptions  in  Theorem  4.1,  if  ~ 1 » 

M 

<_  1 and  li,^  £ 2,  then  a minimax  rule  6 in  is  given  by  (2.2)  and  (4.1) 

M ^ 

where  d^  = d?  in  (4.1)  is  determined  so  that,  for  (x)  = / g^tjdt, 

— oo 

Gi  ^ di - ) + ^3Gi  (-d-j^)  - G.j(-d.j-A.|)+G..(-d..+A.|). 

Now  we  provide  some  examples  to  illustrate  the  application  of  the  above 
results. 


p 

Example  4.1.  Suppose  n.  represents  a normal  population  N(e..,0.)  for 

9 

i = 0,...,k  with  o j (i  = 0,...,k)  known.  We  assume  that  a random  sample 

of  size  ni  is  taken  from  each  of  the  k+1  populations  irg,...,^.  By 

sufficiency  we  can  restrict  our  attention  to  the  decision  rules  depending 

only  on  the  sample  means  XQ,...,Xk  where  has  normal  distribution  with 

2 2 

mean  ik  and  variance  = o.j/n.j  for  i = 0,1,..., k. 


(A)  r -minimax  rule:  The  r-minimax  rule  6r  in  tfg  in  Theorem  4.1  is 
determined  by  d^  = (n^  + Hq)8  max(c.,0)  where  c.  is  defined  so  that 


-2(A.+e.)x 
.e  + 


^(x-X.)  -2x . (x-€. ) 

v;te  +e  3 + 


(4.7) 


-Z\Ax-t,) 

+ J4(1^i^i)e  -Vi  >.lOasx<,>Ci, 

where  = M'^+ng)  11  and  * t(n^+ng)  8. 

(U)  Minimax  rule:  Assume  *"j  = ^ ~ 1 ♦ *3  1 1 and  ± 2.  Then  the 
minimax  rule  f>  in  4>g  in  Corollary  4.1  is  determined  by  d^  = (n^+ng)8t.j 
where t^  is  defined  so  that 


— 
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<t(ti-Ai-ei)+A3$(-t.-Ai-€i)  = 4>(-ti-Ai+€i)+4>(-t.+Ai-€i) 


(4.8) 


with  A^  and  €i  defined  as  in  (A)  and  $ denoting  the  cdf  of  the  standard 
normal  distribution. 


Example  4.2.  Assume  that  ^ represents  a normal  population  N(0,of)  for 

2 


i = 0,1, ...,k  with  o.j  unknown,  and  that  we  have  a random  sample  of  size  n 
taken  from  each  population  tt^.  Consider  a problem  of  partitioning  the 
treatment  populations  in  terms  of  variances  with  a loss  structure 


analogous  to  that  given  by  (2.1),  i.e.,  a loss  function  obtained  from  the 

2 


latter  by  substituting  log  a.,  log  a and  log  e for  e^,  A and  fc,  respectively. 
Thus  A and  € are  assumed  such  that  1 <_  < a.  By  sufficiency  we  need  to 


2 2 2 

consider  only  the  decision  rules  depending  on  SQ,...,sk  where  s.  denotes 


2 2 

the  sample  variance  corresponding  to  n...  Since  ns^/o^  (i  = 0,l,...,k) 
are  independently  distributed  chi-square  random  variables  with  degrees  of 
freedom  n,  it  can  be  easily  seen  that  the  associated  location  parameter 
problem  satisfies  the  assumptions  in  Theorem  4.1  except  the  symmetry 
which  is  not  necessary  in  this  problem  because  of  Remark  4.1.  Therefore, 
with  obvious  modifications  we  have  the  following  results.  Let  denote 


the  class  of  decision  rules  <5  = (6^, ...,6^)  for  which  6 i depends  only  on 


2 2 2 2 
Sq  and  si  and  let  x^  denote  s^/Sq  for  i = l,...,k. 
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* ‘4<,-»r'i><r^y,°<4,*/Z  11 1 *iYi  *s  y i>  - ci 


A,+y 


A9+y 


(4.9) 


Here  A j = Afc  , = 


AC. 


(b)  Minimax  rule:  Assume  fcj . = ^ = 1 , H3  _<  1 and  <_  2.  The  minimax 
rule  in  ^ is  the  same  as  61  in  (A)  except  that  = d is  determined 
so  that 


Gn(d/A2)+lt3[1~Gn(dA2)]  = Wd>+1-Gn(dA1 } (4J0) 

where  &n  denotes  the  cdf  of  F-distribution  with  degrees  of  freedom 
n and  n. 

2 2 

We  note  that  if  n.  represents  N(u^»o^)  with  both  p.  and  unknown, 
then  the  above  results  still  hold  with  n-1  replacing  n. 

‘j.  Comparison  of  r-minimax  rules  with  Bayes  rules. 

When  we  represent  our  a prior  information  about  the  parameters  by 
prior  distributions  over  the  parameter  space,  one  method  for  the  use 
of  such  information  is  to  find  a rule  which  is  r-minimax  with  respect  to 
the  class,  r,  of  such  prior  distributions. 

Another  way  is  to  select  one  such  prior  distribution 

and  use  the  corresponding  Bayes  rule.  Thus  Bayes  rules  wrt  prior 

distributions  in  r are  natural  competitors  of  a r-minimax  rule. 

2 

In  this  section  we  consider  k+1  normal  populations  N(e^,o  ) with 

2 

<1  known,  and  derive  Bayes  rules  wrt  normal  priors  and  then  compare 
them  with  the  corresponding  r-minimax  rules  from  both  points  of  view. 


I 


For  this  purpose,  assume  that  (Qq e^)  have  prior  distribution  Tq  under 

which  0Q,...,a^  are  independent  and  each  has  a normal  distribution 

2 

with  mean  and  variance  v^.  Let  Xq,...,x^  denote  the  observed  sample 
means  based  on  samples  of  size  ni  (i  = 0,1,..., k).  To  simplify  forthcoming 
formulas,  let  us  introduce  the  following  notations; 


°i  = °2/ni>  bj  = [(o^+vj2)’1  +(o0’2  ♦v^2)-1]*, 
mi  = (°i2xi+  vT2Mi)(aT2+vT2)"1,  y.  = (mi-m0)/br 


(5.1) 


The  following  theorem  describes  the  Bayes  rule. 


Theorem  5.1.  Assume  the  loss  function  is  given  by  (2.1).  Then  the  Bayes 

rule  6B  wrt  tq  is  given  by  6B(l|y)  = I(  d -j(y.),  6B(2|y)  = I(_d  >d  j^) 

B * i i 1 

and  S.j(3|y)  = 1^  ^ j (y^ ) for  i = l,..,k  where  d^  = max(ci,0)  with  c^  being 

determined  so  that 

«,3<J>(-A2bT1-y  )+£4[4>(-AibT1-y)-c(-A2bT1-y )] 

+ t2[4>(A1b^1-y)-4.(-A1bT1-y)]-£1$(-A2b^1+y) 

>,  < 0 as  y <,  > ci. 

Proof.  It  suffices  to  find  the  Bayes  rule  for  each  of  the  k component 
decision  problems.  This  reduces  to  the  comparison  of  posterior  risks  of 
three  possible  actions.  We  will  do  this  for  the  first  component  decision 
problem  without  loss  of  generality.  Let  p^y^),  P2(yj)  and  p3(y^)  denote 
the  posterior  risks  of  the  actions  1,  2 and  3,  respectively,  in  the  first 
component  problem.  Then  it  can  be  shown  that 
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P,(y)  = («-1-H3)'K-A2b‘1+y)  + *4[*(A2bj1-y)-*(A1bj1-y)]  + 
+ «.2[4-(A1b‘1-y)-<t>  (-A^-y)], 

P2(y)  = Lt>(- A2b“7-y )+4>(-A2b^1+y )]  and 

p3(y)  = p^-y)- 


Note  that  Pj(y)  - P3(y)  can  be  written  as  E^H(Z)  where  Z has  a normal 
distribution  with  mean  y and  variance  1 and  H(-)  is  given  by 


( 


H(z) 


v 


0 

-*4 

-(£^8,3) 


if 

z > A2b~ 

1 

if 

> 

_cr 

A 

z < 

42bi' 

if 

"Al^1  < 

Z £ 

4ibi' 

if 

-A2bi1  < 

z < 

-i,b- 

if 

Z £ -A2b” 

1 

Since  the  density  of  the  normal  distribution  N(y,l)  has  the  MLR  property, 
it  follows  that  p^y)  - P3(y)  has  at  most  one  sign  change. 

Furthermore,  it  can  be  shown  that  Pi(y)-P3(y)  is  strictly  increasing  on 
(-Ab^1,  Ab“ 1 ) and  p^Oj-p^O)  = 0.  Thus  Pj(y)-p3(y)  >,  < 0 as  y >,  < 0. 
Similarly,  we  can  show  that  p3 (y ) - p2 (y ) >,  < 0 as  y <,  > c^  for  some  real 
number  c1  unless  P3(y)-P2(y)  £ 0 for  all  y.  Therefore  the  result  follows. 

r 

Now  we  compare  the  r-minimax  rule  5 given  in  Example  4.1  and  the 

g 

Hayes  rule  6 given  in  Theorem  5.1  under  the  assumption  that  = e<2  = 8,4  = 1 

2 2 

"3  = ’*•»  ni  = n and  vi  = v for  i = 0,...,k.  Note  that  we  compare  these 

rules  under  the  relations  = $l(-A2+yi-y0)(2v2)"‘]+<i>[(-A2-u  .+y0)(2v2 *)"6] 

and  = l[(A1-M.+lj0)(2v2)_l!]-$[(-A1-Mi+y0)(2v2)^]  for  i = l,...,k.  Each 

of  them  is  the  best  in  its  own  merit.  Therefore  there  are  two  ways  of  any 


— 
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I 


'I  i 


meaningful  comparison  of  these  rules.  One  way  is  to  examine  the  increase 


in  the  overall  risk  wrt  tq  resulting  from  the  use  of  6.  Another  way  is 


2 


to  compare  them  in  terms  of  sup  r(T,6).  When  n.  = n and  v.  = v for  i = 

t€r 

0,1,..., k,  the  Bayes  rule  depends  on  x only  through  x-j-Xq,.  . . ,xk-xQ  and 


B * B 

it  can  be  shown  that  sup  r(T,6  ) = Y sup  r.(t,6^).  Thus  it  suffices  to 

r€r  1*1  Ter  1 1 


compare  these  rules  wrt  classification  of  one  population.  We  choose  uj  for 
this  purpose  without  loss  of  generality. 

Now  we  introduce  the  parameters  used  in  the  comparison  as  follows. 


S1  = 


nv 


2 ’ 32 


e 


and  = 


iirMo 


/2v 


/2v 


7 

/2v 


It  can  be  verified  that  the  overall  risk  wrt  tq  of  these  rules  can  be 


written  as 


m(-A-B-C)+*(A-B-C)+*(D-E)+**(-D-E) 


-*0(-A-B-C,  -D-E;  P)+(l-O*0(-A-B-C,  D-E;P) 


- * (-A+B-C.  D-E;  p)  + <t>g(A-B-C,  -D-E;  p) 


- $q(-A+B-C,  -D-E;  p ) - <i>q ( A— B— C , D-E;  p) 


+ 4>q(-A+B-C,  D-E;  p)-*0(A-B-C,  -D-E;  P) 


<l>0( A+B-C , D-E;  p)-(l-l)*0(A+B-C,  -D-E;  P) 


where  4>q( - , - ; p)  is  the  cdf  of  a bivariate  normal  distribution  with  zero 


means,  unit  variances  and  correlation  coefficient  P,  and  where  A = i^, 
B = b3,  C = p = Bi(l+3-|)"S  D = d-|3^B  for  <$B,  D = max(c1 ,0)(l+n1 


I*  ■*  1 B p 

for  6 , E = P for  & and  E = pc4  for  6 With  dj  and  Cj  being  those  in 


Theorem  5.1  and  Example  4.1,  respectively.  Also  sup  r.U.a,)  for  both  rules 

t6F 
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tan  be  written  as 

y |t'i’(R+|  S | -T-U)-<l>(-R+ 1 S | -T+U)+«,>i’(-R+ 1 S | -T-U)  ] 

♦ yj['l'(-R-S+T-U)+<l»(-R+S-T+U)]V[<t(-R+S+T-U)+$(-R-S-T+U)] 

* (1-y')|.(-R+|S|-T+U) 

where  x y = max(x,y),  T = ^2^1*  U = g3el’  S = ^l*  for  6 , S = 0 

for  a1.  R = ('.^(1+ts-j )*  for  6^,  R = max(c-|,0)  for  6r  with  d-j  and 

C|  being  those  in  Theorem  5.1  and  Example  4.1,  respectively.  For 

selected  values  of  (i  = 1,...,4),  Table  I and  Table  II  give  ri ^ To*6l ^ 

r B 

and  sup  r , ( i , 6 ) for  6,  = <5,,  6.  for  «,  = 0 and  l = 1 , respectively.  It 
ier  ill 

can  be  observed  from  these  tables  that,  in  many  cases,  the  increase  in  the 

P 

overall  risk  wrt  from  the  use  of  6^  is  only  slight  compared  to  that 

B F 

in  sup  r^i,^)  from  the  use  of  6-j.  In  this  sense,  6 is  more  robust 

B r 

against  other  formulation  than  6 . Such  properties  of  6 become  more 

prominent  as  the  difference  between  the  prior  means  (p^)  increases  and 

the  prior  variance  ( H-j ) gets  smaller.  When  we  have  the  same  prior  means 

and  the  prior  variance  is  large,  both  rules  compare  favorably  with  each 

r B 

other.  In  most  cases,  we  can  observe  that  <5  compares  favorably  with  6 
in  terms  of  the  overall  risk. 
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